Characteristic Substructures and Properties in the Chemical Carcinogenicity Studied by the Cascade Model
نویسنده
چکیده
The cascade model is a rule induction methodology using the levelwise expansion of the lattice. An attribute-value pair is expressed as an item, and every node in the lattice is specified by an itemset and by its supporting instances. If the distribution of the class attribute values shows a large change along a link in the lattice, the link is represented as a rule "IF added-item-along-link added on itemset-on-upper-node, THEN class-i". The strength of the rule is measured by the BSS value of the link. In this study, we utilize linear substructure fragments and several physicochemical properties to describe a rule. A fragment leads to one of the two items [frag-i: y] and [frag-i: n] depending on whether or not the fragment exists in a molecule. Application of the cascade model to these items data set gives us rules about carcinogenicity. We could find several rules with large BSS values. Substructures and properties that appear in these rules are expected to provide a starting point for further chemical and biological study. Several rules with classification capability are used to predict the carcinogenicity for the compounds in the test set.
منابع مشابه
Characteristic Substructures and Properties in Chemical Carcinogens Studied by the Cascade Model
MOTIVATION Chemical carcinogenicity is an important subject in health and environmental sciences, and a reliable method is expected to identify characteristic factors for carcinogenicity. The predictive toxicology challenge (PTC) 2000-2001 has provided the opportunity for various data mining methods to evaluate their performance. The cascade model, a data mining method developed by the author, ...
متن کاملWarmr: a data mining tool for chemical data
Data mining techniques are becoming increasingly important in chemistry as databases become too large to examine manually. Data mining methods from the field of Inductive Logic Programming (ILP) have potential advantages for structural chemical data. In this paper we present Warmr, the first ILP data mining algorithm to be applied to chemoinformatic data. We illustrate the value of Warmr by app...
متن کاملModulation Response and Relative Intensity Noise Spectra in Quantum Cascade Lasers
Static properties, relatively intensity noise and intensity modulation response in quantum cascade lasers (QCLs) studied theoretically in this paper. The present rate equations model consists of three equations for the electrons density in the conduction band and one equation for photons density in cavity length. Two equations were derived to calculate the noise and modulation response. Calcula...
متن کاملMonitoring the censored lognormal reliability data in a three-stage process using AFT model
Improving the product reliability is the main concern in both manufacturing and service processes which is obtained by monitoring the reliability-related quality characteristics. Nowadays, products or services are the result of processes with dependent stages referred to as multistage processes. In these processes, the quality characteristic in each stage is affected by the quality characterist...
متن کاملA Numerical Investigation on the Unstable Flow in a Single Stage of an Axial Compressor
An unsteady two-dimensional finite-volume solver was developed based on Van Leer’s flux splitting algorithm in conjunction with “Monotonic Upstream Scheme for Conservation Laws (MUSCL)” limiters to improve the order of accuracy and the two-layer Baldwin-Lomax turbulence model was also implemented. Two test cases were prepared to validate the solver. The computed results were compared with the e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001